- Home
- Search Results
- Page 1 of 1
Search for: All records
-
Total Resources4
- Resource Type
-
0003000001000000
- More
- Availability
-
40
- Author / Contributor
- Filter by Author / Creator
-
-
Cao, Tianyu (4)
-
Barradas, Diogo (1)
-
Chen, Ang (1)
-
Dervovic, Danial (1)
-
Gattani, Aniket (1)
-
Gorte, Raymond J. (1)
-
Jiang, Haoming (1)
-
Kon, Patrick (1)
-
McIntosh, Steven (1)
-
Paige, Julian M. (1)
-
Raman, Natraj (1)
-
Sherr, Micah (1)
-
Tan, Chenhao (1)
-
Ujcich, Benjamin (1)
-
Vohs, John M. (1)
-
Vu, Duytam (1)
-
Yin, Bing (1)
-
Zhang, Danqing (1)
-
Zhao, Tuo (1)
-
#Tyler Phillips, Kenneth E. (0)
-
- Filter by Editor
-
-
& Spizer, S. M. (0)
-
& . Spizer, S. (0)
-
& Ahn, J. (0)
-
& Bateiha, S. (0)
-
& Bosch, N. (0)
-
& Brennan K. (0)
-
& Brennan, K. (0)
-
& Chen, B. (0)
-
& Chen, Bodong (0)
-
& Drown, S. (0)
-
& Ferretti, F. (0)
-
& Higgins, A. (0)
-
& J. Peters (0)
-
& Kali, Y. (0)
-
& Ruiz-Arias, P.M. (0)
-
& S. Spitzer (0)
-
& Sahin. I. (0)
-
& Spitzer, S. (0)
-
& Spitzer, S.M. (0)
-
(submitted - in Review for IEEE ICASSP-2024) (0)
-
-
Have feedback or suggestions for a way to improve these results?
!
Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
-
As large language models (LLMs) expand the power of natural language processing to handle long inputs, rigorous and systematic analyses are necessary to understand their abilities and behavior. A salient application is summarization, due to its ubiquity and controversy (e.g., researchers have declared the death of summarization). In this paper, we use financial report summarization as a case study because financial reports are not only long but also use numbers and tables extensively. We propose a computational framework for characterizing multimodal long-form summarization and investigate the behavior of Claude 2.0/2.1, GPT-4/3.5, and Cohere. We find that GPT-3.5 and Cohere fail to perform this summarization task meaningfully. For Claude 2 and GPT-4, we analyze the extractiveness of the summary and identify a position bias in LLMs. This position bias disappears after shuffling the input for Claude, which suggests that Claude seems to recognize important information. We also conduct a comprehensive investigation on the use of numeric data in LLM-generated summaries and offer a taxonomy of numeric hallucination. We employ prompt engineering to improve GPT-4's use of numbers with limited success. Overall, our analyses highlight the strong capability of Claude 2 in handling long multimodal inputs compared to GPT-4. The generated summaries and evaluation code are available at https://github.com/ChicagoHAI/characterizing-multimodal-long-form-summarization.more » « less
-
Kon, Patrick; Gattani, Aniket; Cao, Tianyu; Barradas, Diogo; Chen, Ang; Sherr, Micah; Ujcich, Benjamin (, IEEE Security and Privacy)
-
Jiang, Haoming; Zhang, Danqing; Cao, Tianyu; Yin, Bing; Zhao, Tuo (, Annual Meeting of the Association for Computational Linguistics)
-
Paige, Julian M.; Vu, Duytam; Cao, Tianyu; McIntosh, Steven; Gorte, Raymond J.; Vohs, John M. (, Journal of The Electrochemical Society)
An official website of the United States government

Full Text Available